F0 feature extraction by polynomial regression function for monosyllabic Thai tone recognition
نویسندگان
چکیده
This paper presents a monosyllabic Thai tone recognition system. The system is composed of three main processes, fundamental frequency (F0) extraction from input speech signal, analysis of F0 contour for feature extraction, and classification of each tone using the extracted features. In the F0 feature extraction, the polynomial regression functions are employed to fit the segmented F0 curve where its coefficients are used as a feature vector. In tone recognition, we used the maximum a posteriori probability classifier (MAP) to classify a tone by assuming that the feature is a multidimensional Gaussian random variable. The hypothetical words used in this paper are composed of numerical words and monosyllabic Thai words. The vocabulary set is composed of the short vowel words, the long vowel words and have the effect of initial and final consonant on the shape of F0 contour. The experimental results show that by using the system as a speaker-dependent system, the maximum recognition rate is 96.20% using three-dimension feature vector. The speakerindependent recognition rates are 79.99% for male and 82.80% for female using four-dimension feature vector.
منابع مشابه
Monosyllabic Thai Tone Recognition Using Ant-Miner Algorithm
Recognition of tone is essential for speech recognition and language understanding. A monosyllabic Thai tone recognition system, which is based on the Ant-Miner algorithm. The system is composed of three main process, fundamental frequency (F0) extraction from input speech signal, analysis of F0 contour for feature extraction, In the F0 feature extraction, the polynomial regression functions ar...
متن کاملA VLSI Architecture of Tone Classification Function-Based Isolated-Word Speech Recognition
Speech recognition in tonal languages such as Thai, Chinese, etc. classifies word meaning by using tone. Therefore tone classification function is extremely essential part for improving accuracy rate. This paper presents a novel VLSI architecture of tone classification function-based isolated word speech recognition. The architecture consists of two parts; feature extraction and tone classifica...
متن کاملTone recognition in Thai continuous speech based on coarticulaion, intonation and stress effects
Tone recognition is a critical component for speech recognition in a tone language. One of the main problems of tone recognition in continuous speech is that several interacting factors affect F0 realization of tones. In this paper, we focus on the coarticulatory, intonation, and stress effects. These effects are compensated by the tone information of neighboring syllables, the adjustment of F0...
متن کاملClassification of Thai consonant naming using Thai tone
This paper proposes the novel technique for separation of Thai consonant naming or consonant spelling using its tones. Consonant spelling is used for many applications such as a voice-actuated typewriter that helping to correct the confusable word in sound. Because fundamental frequency (F0) can be suitably used in tone classification for Thai speech recognition, which is tonal language of five...
متن کاملLaryngealization and features for Chinese tonal recognition
It is well known that the lowest tone in Mandarin, a language without contrastive phonation, often co-occurs with laryngealization/creaky voice quality, and we provide evidence that this is also the case for the lowest tone in Cantonese. However, the effects of laryngealization on f0 feature extraction for tonal recognition, as well as the potential of laryngealization as a feature for improvin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001